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Abstract — In a CDMA cellular system, the process of the 
mobile station searching for a cell and achieving code and time 
synchronization to its downlink scrambling code is referred to 
as cell search. Cell search is performed in three scenarios: initial 
cell search when a mobile station is switched on, idle mode search 
when inactive, and active mode search during a call. The latter 
two are also called target cell search. This paper presents algo- 
rithms and results for both initial and target cell search scenarios 
for the Wideband CDMA (W-CDMA) standard. In W-<1>MA, 
the cell search itself is divided into five acquisition stages: slot 
synchronization, frame synchronization and scrambling code 
group identification, scrambling code identification, frequency 
acquisition, and cell Identification. Initial cell search needs all five 
stages, while target cell search in general does not need the last 
two stages. 

A pipelined process of the first three stages that minimizes the 
average code and time acquisition time, while keeping the com- 
plexity at a reasonable level, is considered. The frequency error in 
initial cell search, which may be as large as 20 kHz, is taken care of 
by partial symbol despreading and noncoherent combining. Op- 
timization of key system parameters such as the loading factors 
for Primary Synchronization Channel, Synchronization Channel, 
and Common Pilot Channel for achieving the smallest average code 
and time acquisition time is studied. After code and time synchro- 
nization (the first three stages), a maximum likelihood (ML)-based 
frequency acquisition method is used to bring down the frequency 
error to about 200 Hz. The gain of this method is more than 10 dB 
compared to an alternative scheme that obtains a frequency error 
estimate using din'erenttal detection. 

Index Terms — ^Acquisition, cell search, code division multiple ac- 
cess, frequency error estimation, spread spectrum communicatioii, 
synchronization. 

I. INTRODUCTION 

THIRD-GENERATION (3G) cellular systems are currently 
being standardized in various standardization bodies. The 
main objectives are to provide higher data rate services with im- 
proved service quality and capacity compared to the second- 
generation cellular systems. Among the 3G cellular systems 
being considered, IS-2000 [1] (in North America) and Wide- 
band Code-Division-Multiple-Access (W-CDMA) [1], [2] (in 
Europe and Japan) are both based on direct-sequence code-divi- 
sion multiple-access (DS-CDM A) technology. In these systems, 
spreading codes are used to differentiate physical channels from 
the same transmitter, and scrambling codes are used to differen- 
tiate transmitters. A mobile station (MS) needs to code and time 
synchronize to the scrambling code used by the serving cell be- 
fore any communications with the base station can take place. 
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One of the major differences between W-CDMA and IS-2000 
is that W-CDMA supports asynchronous base stations, whereas 
IS-2000 relies on synchronized base stations. With synchro- 
nized base stations, all cells (or sectors) can use shifts of the 
same scrambling code, so that a cell is identified by a imique 
code phase shift of the scrambling code. On the other hand, 
without time and friequency synchronization between base sta- 
tions, using different phases of the same code for scrambling 
is not sufficient to resolve the code ambiguity in the presence 
of time ambiguity. Thus, in an asynchronous CDMA system, 
cells can only be identified by using distinct scrambling codes. 
W-CDMA uses S12 downlink primary scrambling codes, al- 
lowing unique cell identification in every cluster of 512 cells 
[3]. The process of searching for a cell and synchronizing to its 
downlink scrambling code is often referred to as cell search. Cell 
search is necessary after the MS has switched on (initial search), 
and during idle and active modes, for identifying new camping 
cells or handover candidates, respectively. Idle and active mode 
search is also called target cell search. The performance of cell 
search impacts the perceived switch-on delay (initial search), 
stand-by time (idle mode search), and link quality (active mode 
search), and thus is important to MS design. 

In addition to code and time uncertainty, the degree of 
frequency uncertainty can be large during initial search. Mass 
produced consumer electronics commonly use inexpensive and 
rather inaccurate crystal oscillators. Such crystal oscillators 
have inaccuracies in the range of 3-13 ppm, giving rise to 
a frequency error in the range of 6-26 kHz, when operated 
at 2 GHz. Such a frequency error, if not corrected, results in 
severe degradation in receiver performance so that the receiver 
cormnunication functions foil. An important goal for the MS is 
thus to reduce its finequency error to a reasonable range during 
initial search so that further communication functions can take 
place. Cell search in W-CDMA deals with rather large code, 
time, and frequency uncertainties, and thus poses an interesting 
problem for system and MS design. 

This paper addresses algorithms for cell search in W-CDMA. 
Both initial and target cell search scenarios are studied. The 
frequency errors considered are 20 kHz and 0 Hz, respectively, 
for the initial cell search and target cell search. Instead of 
the traditional method sequentially searching through code, 
time, and frequency [4, pp. 223-225], the proposed method 
first acquires code and time synchronization assuming a large 
frequency error, and then performs frequency acquisition. The 
synchronization process in W-CDMA can tfius be viewed 
as the five stage process of 1) slot synchronization, 2) frame 
synchronization and scrambling code group identification, 3) 
scrambling code identification, 4) frequency acquisition, and 
5) cell identification. The combined goal of the first three 
stages is to deliver a reliable code- and time-candidate to the 
frequency acquisition stage (stage 4) with low delay and low 
complexity. To minimize the delay, we consider a pipelined 
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Fig. I , Frame and slot structures for CPICH, P-SCH. and S-SCH. 

process of these first three stages with all stages continuously 
ninning to produce results for the next stage. The effect of the 
large frequency error in these stages is reduced using partial 
symbol despreadmg with noncoherent combining. In addition 
the possibility of using coherent accumulation in stage 2 fframe 
synchronization and scrambling code group identification) 
IS exploited. We will show that using coherent accumulation 
m stage 2 si^ificantly improves cell search performance. 
We will also show that such a coherent detection scheme is 
robust even when the channel estimation is severely degraded 
by an interfering synchronization signal. For scrambling 
code Identification, we introduce a low complexity method 
consisting of symbol-by-symbol detection of the most likely 
scrambling code and then majority voting of the most likely 
saximblmg code over one or several frames. An advantage 
witii this proposed method is that the threshold needed in the 
majority voting can be determined analytically for a tarcet false 
detection probability. The average acquisition time for code 
and time synchronization is used to optimize the system and 
MS design parameters. We will show that by optimizing these 
parameters the acquisition time can be reduced significantly 
Once a highly reliable code and time candidate is found the 
firequency eiTor is estimated using a maximum-likelihood bksed 
low-complexity method. 

The paper is organized as follows. Section II describes syn- 
chronization channels and cell search procedures in W-CDMA 
In Section III, the system and signal models are given. Code and 
time acquisition is studied in Section IV. The issue of parameter 
optimization for achieving the best average acquisition time is 
addressed in Section V. In Section VI, frequency acquisition is 
investigated. Finally, conclusions are given in Section VII. 



11. Synchronization Channels and Cell Search 
Procedure 

In W-CDMA, a cell is identified mainly by its downlink 
scrambling code. There are 512 primary downlink scrambling 
codes reused throughout a system. These 512 codes are 
based on length 2^^ - 1 Gold sequences truncated to one 
frame interval, which is 38400 chips for the chip rate 3 84 
Mchips/s. To reduce the complexity of searching through the 
512 downlink primary scrambling codes, the concept of code 



grouping and the use of code group indicator codes (GIC) were 
introduced in [5] and [6]. The scrambling code is identified 
by first Identifying its code group to significantly reduce the 
degree of code uncertainty. The complexity of cell search is 
further reduced by combining code group identification and 
trame boundary synchronization into one stage [7] With this 
scheme, the time uncertainty is completely resolved when the 
code ^up identity is obtained. As a result, the complexity of 
identifying the scrambling code in the identified code grouo is 
significantly reduced. Schemes with further complexity reduc- 
tion by increasing the number of code groups were proposed 
in [8J. According to [3], the 512 downlink primary scrambling 
codes are divided into 64 groups, each of 8 codes. 

To facilitate cell search, three channels are used, namely the 
Pnmary Synchronization Channel (P-SCH). the Secondary Syn- 

(CPICH) [9J. The P-SCH together with the S-SCH are also re- 
ferred to as the Synchronization Channel (SCH) Fig 1 ilius 

^""^ ^""^ ^^"""^ ""^"^^^^ '^h^""^'^- Each frame 
of 38400 chips (or 10 ms) is divided into 15 slots, of each 2560 

^^^^^ ^*-SCH and S-SCH have 

a 1 0% duty factor. 

The CPICH, which is used to carry the downlink common 
pilot syrnbols, is scrambled by the primary downlink scram- 
bling code of the cell. Within each CPICH time slot, there are 1 0 
pilot symbols, each spread by 256 chips. All symbols are QPSK 
modulated, and the modulation values of the pilot symbols are 
known once the MS knows the fi^e boundary. The spreading 
sequence of CPICH is taken from the set of Orthogonal Vari 
able Spreading Factor (OVSF) codes [3], maintaining mutual 
Orthogonality between CPICH and the other downlink channels 
also spread by OVSF codes. 

Unlike CPICH, neither the P-SCH nor the S-SCH is scram- 
bled by the pnmaiy downlink scrambling code. Instead of the 
p sequences of length 256 chips are used. The 

P-5»CH sequence is transmitted once in the same position in 
every slot, and can thus be used for detecting the slot boundary 
Furthenmore, all cells use the same P-SCH sequence. As a re- 
sult, only one P-SCH matched filter is needed to detect the slot 
boundanes of downlink signals. To reduce the complexity of the 
P-SCH matched filter, the P-SCH sequence is derived from the 
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Kronecker product of two sequences of length 1 6 [ 1 0}. With this 
property, the P-SCH matched flher can be implemented as two 
concatenated matched filters, each matched to one of the two 
constituent length 16 sequences, achieving a complexity reduc- 
tion by approximately a factor of 8. 

The S-SCH is used to identify the frame boundary and 
scrambling code group identity. Unlike the P-SCH sequence, 
the S-SCH sequences vary from slot to slot. There arc 16 
S-SCH sequences, mapped correspondingly to 16 S-SCH 
symbols, labeled from 1-16. A frame (15 slots) of 15 such 
S-SCH symbols forms a codeword taken from a codebook of 
64 codewords. The same codeword is repeated every fhune in 
a cell. These 64 codewords correspond to the 64 code groups 
used throughout the system; thus a code group can be detected 
by identifying the codeword transmitted in every S-SCH 
frame. Furthermore, the 64 codewords are all chosen to have 
distinct code phase shifts, and any phase shif^ of a codeword 
is different from all phase shifts of all other codewords. With 
these properties, the frame boundary can be detected by 
identifying the correct starting phase of the S-SCH symbol 
sequence. To maximize the minimum symbol distance of the 
codebook, between different cyclic shifh of the same codeword 
or between any cyclic shifts of different codewords, the use of 
a Comma-Free Reed-Solomon (RS) code was proposed [11]. 
For 15 slots per frame, a (15, 3) Reed-Solomon (RS) code 
over GF(16) is used. The RS code has a minimum distance of 
13. Moreover, to minimize cross-channel interference, the 16 
S-SCH sequences and the P-SCH are mutually orthogonal [3]. 

Given the above Synchronization Channels and CPICH, code 
and time synchronization can be achieved by the following 
stages [12]: 1) slot boundary detection based on P-SCH (using 
a P-SCH matched filter); 2) frame boundary detection and 
scrambling code group identification based on S-SCH (using 
conelators, correlating against 16 S-SCH sequences, and an RS 
decoder); 3) scrambling code detection based on CPICH (using 
correlators, correlating against all scrambling codes in the 
identified code group). For initial search, the ultimate goal is to 
decode the cell identity of the acquired signal. To achieve this, 
two extra stages are needed: 4) frequency acquisition based 
on CPICH (to reduce initial frequency error so that the MS 
can decode the broadcast information); and 5) detecting cell 
identity (by reading the broadcast information). The frequency 
acquisition step is necessary because of the large frequency 
error after MS powered on. Without correcting the frequency 
error, ^e cell identity, transmitted in the Broadcast Channel, 
cannot be decoded reliably. For the target cell search, there 
is no ambiguity in the mapping from a downlink scrambling 
codes and the cell identity of a neighboring cell. Thus, identi- 
fying (and synchronizing to) the downlink scrambling code is 
sufficient to identify any given cell of interest 

We will focus on the code, time, and frequency acquisition 
aspects of cell search. Hence, stage 5 will not be addressed. 
Pipelined processing for stages 1 , 2, and 3 is considered, as illus- 
trated in Fig. 2. Stage 4 — frequency acquisition — is only acti- 
vated when time and code synchronization is achieved. To min- 
imize the delay in the pipe (no idle time in the pipe), the syn- 
chronization times used in stages 1 , 2, and 3 are the same (JVj 
slots). According to Fig. 2, stage 1 always generates a list of slot 
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Fig. 2. Cell search procedures. 

boundary candidates at the end of each cycle. Based on each of 
the slot boundaries detected in stage 1, stage 2 finds S-SCH and 
performs S-SCH correlations and RS decoding. At the end of 
each detection cycle, stage 2 always gives a list of candidates of 
frame boundary-code group pair to stage 3 for identification of 
the scrambling code. In contrast to stages 1 and 2, stage 3 only 
activates stage 4, when a candidate is detected with high confi- 
dence. The acquisition time for code and time synchronization 
is of interest According to Fig. 2, the acquisition time can be de- 
fined as the time interval between the time when the pipelined 
process started and the time when stage 3 terminates the process. 
If stage 3 went through cycles before it accepts a detected 
code and terminates the pipelined process, the acquisition time 
can be given by 



racq = (i^3 + 2)Mr.u,t 



(1) 



where T^iot is the slot duration. 



III. System AND Signal Model 

In this section, we describe system and signal models. First, 
to evaluate the performance of each of the individual cell search 
stages, a signal model as illustrated in Fig. 3 is used. The unity- 
power desired signal 34{t) is scaled with power and the 
unity- variance complex Gaussian noise n(t) is scaled by power 
Px- Hence, the signal-to-noise ratio (SNR) is given by Pd/Px- 
For stages 1 and 2, the desired signal is the Synchronization 
Channel (P-SCH + S-SCH); whereas for stages 3 and 4. the de- 
sired signal is CPICH. 

The term e'**'**+* in Fig. 3 is used to model the effect of 
a frequency error /«, with the phase 9 uniformly distributed on 
[0, 27r). The desired signal is passed through a time-varying fre- 
quency-selective fading channel with impulse response ^(r; £), 
before the impairment component n(t) is added. The impulse 
response of the &ding channel is given by 

9(r:t) = Y^gf{t)6iT-nit)) (2) 
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Fig. 3. The signal model used for evaluating the performance of each of the 
indtvidual cell search stages. 

where 

L is the number of resolvable multipaths, 

9iit) is the complex gain of the /th path at time t, and 

Tj(t) is the delay of the /th path at time t. 

Since the delays of multipaths vary slowly with time, they are 
considered constant in this paper, i.e., ri(t) = n. 

For evaluating the performance of the pipelined process 
shown in Fig. 2, a system model which includes the SCH and 
CPICH signals as illustrated in Fig. 4 is used. Signals Si{t), 
S2{t), and S3{t) represent the normalized P-SCH, S-SCH, and 
CPICH signalsj respectively. These signals are later scaled ac- 
cordingly to have powers of Pi , P2, and P3, respectively. Noise 
ni{t) is used to model the intra-cell interference, which has 
total power P/. This intra-cell interference in our simulations 
is generated by summing up 16 different equal-power physical 
channels, each with spreading factor 64. The spreading codes 
of these interfering channels are chosen to be orthogonal to 
the one used in CPICH. As a result, in flat fading ni(t) is 
orthogonal to the CPICH signal, ssit), after despreading. A 
complex Gaussian process 712 (0 is used to model inter-cell 
interference. Both ni(t) and 712(0 have a variance equal to 1. 
The frequency error term cJ W-t+tf jhe impulse response 
of the fading channel fl(r; t) are the same as those used in Fig. 

Instead of giving the absolute powers of all downlink chan- 
nels, it is often convenient to introduce some power ratios or 
loading factors of these channels; e.g.. the P-SCH loading factor 
is defined as the power ratio between the P-SCH and the SCH. 
Or = Pi/(Pi + P2). The power ratio between the SCH and all 
downlink signals from the same cell is defined as 0 = (Pi + 
Pi) I {P\ + Pa + P3 + P/), and is often referred to as the SCH 
loadii^ factor. Similarly, the CPICH loading fector is defined 
as 7 = P3/(Pi + P2 + P3 + P/). System parameters a, /3, 
and 7 are critical to cell search performance. Finally, a geom- 
etry factor is defined as the ratio between the power of the de- 
sired base station*s signal and the postchannel interference, i.e., 
= (A + P2 + P3 + P/)/Py. a larger value of G indicates 
that the MS is closer to the center of a cell. Typical values for G 
are in the range of -6 to 6 dB. 

Finally, throughout the paper, the P-SCH sequence is repre- 



where c<"'> 6 {1, -1}, / = 0, 1, . . . , 255. m = 0. 1, . . . , 16, 
are the chip values. 

IV. Algorithms for Code and Time Acquisition 
In this section, we study the code and time synchronization 
aspects of cell search. It is assumed that frequency synchroniza- 
tion is not started until code and time synchronization has been 
achieved (see Fig. 2). Thus, during initial cell search, code and 
time synchronization has to be achieved in the presence of a 
rather laiige frequency error. We will consider both a best-case 
scenario — 9. 0 Hz error— and a worst case scenario — a 20 kHz 
error. The best case scenario is close to a practical operation 
condition in idle and active modes (target cell search), where a 
very small frequency error, typically less than 200 Hz, has to be 
maintained. 

A. Stage I: Slot Synchronization 

In stage 1, the P-SCH matched filter is used to detect the 
slot boundary. Due to a low operating signal-to-noise ratio, the 
matched filter outputs have to be noncoherently accumulated 
over many slots to get reliable decision statistics. After accu- 
mulation, a number of candidates are identified as possible slot 
boundary candidates. The receiver block diagram for stage 1 is 
shown in Fig. 5. In our study, we use one sample per chip for 
rfc. and optimum sampling timing. - - - - 

The matched filter output is given by 



255 



(3) 



where, by referring to Fig. 3, the received signal with the P-SCH 
sequence present is 

1=0 

Here oi is the P-SCH loading factor defined in Section III, 
denotes the chip duration, p,, jt = gi{kT^), and includes sam- 
ples of all the interference terms, i.e., S-SCH and n{t). When the 
frequency error is as large as 20 kHz. using (3) would result in 
large coherence loss. This is because the exponential component 
in (4) results in phase rotation in rjb. To alleviate this problem, 
the P-SCH sequence can be treated as a concatenation of a few 
short sequences. The P-SCH matched filter therefore consists 
of filters, each matched to each of the short sequences. The out- 
puts of the short sequence matched filter are combined nonco- 
herently to prevent phase rotations from propagating across dif- 
ferent short sequences matched filters. Fig. 6 illustrates such a 
matched filter architecture. 

The matched filter output according to this architecture is 
given by 



h'-i 



sented by (4"\ c^^\ . . . , and the mth S-SCH sequence 



U denoted by (4"''. -fT^T'-^, 4S). "» = 1, 2, 16. 



(5) 
(6) 
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Fig. 4. The signal model used for evaluating the perfoniiance of the pipelined cell search piocess. 
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where M is the length of each short sequence, and K is the 
number of short sequences such that KM — 256. The length 
of each short sequence, M, determines the performance of slot 
synchronization in the presence of frequency error. A larger 
M results in more phase rotation during matched filtering and 
larger incoherence loss. However, a large M also results in 
better noise suppression and may thus improve performance. 
Therefore, M has to be optimized for different frequency 
errors. In [13], it was shown that the signal-to-noise ratio 
(SNR) for the peak of matched filter output is 



SNR(M) 



(7) 



where Pjsf is the power of the noise. For initial cell search, with a 
frequency error of 20 kHz, M = 64 is the best choice, assuming 
that M is restricted to a power of 2. For target cell search, with 
a frequency error typically less than 200 Hz, the best choice is 
M = 256. in the following, we use M = 64 for the initial cell 
search and M = 256 for the taiget cell search. 

The performance of stage 1 with 15 slots (10 ms) synchro- 
nization time in flat fading channels is shown in Fig. 7. Slot syn- 
chronization error rate, defined as the probability that none of 
the detected candidates coincides with the slot boundary of any 
resolvable multipath, is plotted versus SNR. Referring to Fig. 
3, SNR = Pd/Px' Note that P^ includes the power from both 
the P-SCH and the S-SCH. Furthermore, the results in Fig. 7 are 
based on a = 0.5, i.e., the power of Synchronization Channel 
is evenly split between P-SCH and S-SCH and that only one 
candidate is detected {Li = 1). If stage 1 gives more than one 
candidate to stage 2, the performance improves, at the expense 
of higher stage 2 complexity. From Fig. 7, we conclude that 20 
kHz frequency error (initial search) results in significant loss in 
stage t performance, and that the performance improves with 
increasing fading rate. A higher fading rate gives rise to faster 
received level variation, and thus less time correlation between 
fading channel realizations. When there is less time correlation 
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between fading channel realizations, better time diversity gain 
can be achieved when accumulating the matched filter output 
over more than one slot. 

B. Stage 2: Frame Synchronization and Code Group 
Identification 

After achieving slot synchronization, S-SCH can be easily 
found. In stage 2, the receiver operations start with correlating 
the received signal of SCH with all 16 S-SCH sequences, and 
then accumulates S-SCH correlations over Nt slots according 
to the 64 RS codewords used, each with 15 hypothesized frame 
boundaries. The total number of hypotheses is therefore 960 (64 
codewords times 15 shifts). At the end, the hypothesis with the 
largest accumulated metric is chosen as the candidate for frame 
boundaiy-code group pair, which is given to stage 3 for scram- 
bling code identification. Due to the frequency error and fading, 
there is.a considerable amount of phase rotation in one slot du- 
ration. Thus, noncoherent accumulation has to be used for com- 
bining S-SCH correlations from slot to slot, if such a phase ro- 
tation is not corrected. However, since P-SCH is always trans- 
mitted along with S-SCH, the P-SCH correlation can be used as 
a phase reference to phase-correct the S-SCH correlations. We 
address both noncoherent and coherent detection below. 

1) Noncoherent Detection: Let rf^ be the jth sample of the 
Arth received SCH symbol, where j = 0, 1, 2, ... , 235 (i.e., 
for each slot there are 256 samples), and Jfc = 0, 1, . . . , JV^ - 1. 
Furthermore, let w^ij) be the j^h symbol of the RS codeword 
associated with code group k = 1, 2, . . . , 64, and j ~ 
0, 1, . . . , 14. Symbol wi,{j) takes values in {1, 2, ... , 16}, 
with each symbol value mapped onto an S-SCH sequence. The 
algorithm for stage 2 consists of the following steps. 

i) For received slot A, calculate S-SCH conelations 



255 



m=l,2, ...,16, 



& = 0. 1, ...,N,-l. (8) 

Note that there are leNt correlations in total. The sum- 
mation in (8) assumes that the frequency error during this 
stage is small so that coherent combining over 256 chips 



is possible. As mentioned earlier, for a 20 kHz frequency 
error, it is better to treat each S-SCH sequence as 4 short 
sequences, each of 64 chips. Coherent correlation is done 
on a short sequence, and the correlation values of short 
sequence are combined noncoherently. For this case, (8) 
is modified as 



64(/+l)-l 



(9) 



ii) Accumulate S-SCH correlation values. Since the RS 
codewords are periodically repeated in every fi^me, if 
A^t > 15 (i.e., synchronization time is longer than one 
frame), the correlation values corresponding to the same 
S-SCH sequence in different frames can be combined. 

(j mod I5)=k 

m = l, ...,16, A; = 0, ...,14. (10) 

iii) RS code decoding. The brute-force approach is to corre- 
late Sm(k) with each RS codeword and with each cyclic 
shift. 

14 

Xi (m) = ^ S^^ „^ 1 - J (k) , 

m = 0,...,14, t = l,...,64. (11) 

The metric Xi(m) is associated with the hypothesis of 
code group i and frame timing m. 

iv) Find the maximum Xi{m). The code group number can 
be found by 



i — arg max Jf, (m) 



(12) 



and the slot number corresponding to the first received 
slot in this stage is 



m = arg max Xi(m). 



(13) 



The pair (i, m) is given to stage 3 for identification of 

the primary scrambling code. 
2) Coherent Detection: For coherent detection, the P-SCH 
is used for deriving a channel estimate. The channel estimate is 
used to phase-correct the S-SCH correlations before combining. 
With this approach. (8) can be modified to be 

Sm(k)=:Bje{C^{k)C^ik)}, 

m = l,...,16,A; = 0,...,Art-l (14) 



where 



255 

C,(k) = 2] rf )<=<'). 



(15) 



Note that in (14), Co(k) corresponds to the P-SCH correlation, 
and C^(^') corresponds to the correlation with the mth S-SCH 
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Fig. 8. Stage 2 perfonnance (A't = 15, a s 0.5, flat fiding, 9.26 Hz 
Doppler). 

sequence. Again, the siimmation in (15) assumes that the fre- 
quency error during this stage is small. If the frequency error is 
as laige as 20 kHz, the following equations can be iised instead. 

3 

Sm{k) = Re {Co^|(^)C«.,(fc)} (16) 

where 

Cm^m^f^r^^+A^ (17) 
i=o 

In ( 1 6), the partial summation of 64 chips is first phase corrected 
before added. The remaining steps for coherent detection are the 
same as those described previously for noncoherent detection. 

The performance of stage 2 in a flat fading channel with 9.26 
Hz Doppler is shown in Fig. 8, for both coherent and nonco- 
herent detection, and for 0 Hz and 20 kHz frequency errors. 
Again, the power of Synchronization Channel is evenly split be- 
tween P-SCH and S-SCH. The synchronization error rate here 
is defined as the probability of stage 2 failing to detect the right 
code group number and frame boundary, given the correct slot 
boundary is provided. From Fig. 8, we see that coherent detec- 
tion achieves much better performance than noncoherent detec- 
tion, especially during initial search when the frequency error 
is large. However, coherent detection relies on the P-SCH cor- 
relation to get a phase reference. In W-CDMA, all the cells use 
the same P-SCH sequence. When the P-SCH of difTerent cells 
are time-aligned, the channel estimate obtained through P-SCH 
correlation will not be accurate. This potential problem with co- 
herent detection will be addressed in the next section. Moreover, 
compared to stage 1 perfonnance, stage 2 perfonnance is much 
better, when coherent detection is used. 

C. Stage 3: Scrambling Code Identification 

After identifying the scrambling code group and achieving 
frame synchronization, the downlink primary scrambling code 
can be identified by correlating the CPICH with all possible 
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Stage 3 Detection Thresholds 
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scrambling codes in the identified code group. The number of 
scrambling codes in the identified code group, iVi.Ci depends 
on the operating scenario. For initial search, TVlc = 8, whereas 
for idle and active mode search, Ni^c is typically 1 or 2, given 
a good system-wide code planning [8]. One of the main de- 
sign considerations in this stage is the probability of false de- 
tection. According to Fig. 2, stage 3 accepts a candidate only 
when the detection metric is greater than a threshold, where 
the threshold is predetermined to achieve a certain false de- 
tection probability. For initial search, once a scrambling code 
is "detected," the processes of frequency acquisition (stage 4) 
and eventually reading the broadcast information (stage 5) are 
started. For idle and active mode search, a detected scrambling 
code will be time-tracked and the signal-to-interference ratio 
(SIR) thereof will be measured and reported back to the base 
station. To minimize unnecessary MS activities, it is thus desir- 
able to have a very low false detection probability during stage 3. 

The detection scheme considered in this paper is to correlate 
each received CPICH symbol with JVlc hypothesized scram- 
bling codes. The correlation is performed with 256 or 64 chips of 
coherent combining depending on the frequency error scenario 
as discussed earlier. For each CPICH symbol, the scrambling 
code with the largest correlation value is chosen. At the end, 
afler collecting lONt symbols (i.e., Nt slots with 10 symbols 
per slot), a majority vote is used to determine the most likely 
scrambling code. The detected scrambling code is accepted only 
when the detection metric (number of votes) associated with 
the detected scrambling code is greater than the predetermined 
threshold. The number of symbols used in stage 3 is J = lONt, 
and let X) be the detection threshold. Using the union bound, the 
false detection (false alann) probability can be approximated by 

(18) 

According to our study, this union bound is very tight for /Va ^ 
0.01. Using (18), the detection threshold D can be determined. 
Table I lists the values of D in various scenarios. 

Fig. 9 shows the performance of stage 3 in a flat fading 
channel with 9.26 Hz Doppler and 10 ms synchronization time 
(Nt = 15). The target false detection probability is 10~*, and 
the miss detection probability is defined as the probability 
of stage 3 failing to detect a scrambling code given it starts 
with a correct frame boundary and code group number. This 
occurs when the detection metric is less than the predetermined 
threshold £>. The two curves in Fig. 9 correspond respectively 
to the initial cell search (dash line) and target cell search (solid 
line) scenarios. 
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V. Performance of Pipelined Process and Parameter 
Optimization 

In this section, the performance of the three-stage pipelined 
cell search process illustrated in Fig. 2 is studied. Both ini- 
tial and target cell search scenarios are considered. For initial 
search, the MS has to search through all 512 possible primary 
scrambling codes, and as long as it can identify one valid scram- 
bling code, the search process is considered a success. In target 
cell search scenarios, we assume that the MS is given a list of 
taiget scrambling codes from the serving base station. In our 
study, we consider that the target scrambling codes are dis- 
tributed among 16 code groups, and each code group can have at 
most two target scrambling codes. The acquisition time during 
taiget search is defined as the time required to identify a desired 
target code among the list of all target scrambling codes. The 
algorithm described in Section IV-C is used to determine if a 
code is identified reliably, with a false detection probability of 
10-^. 

Fig. 10 shows the average acquisition time in a flat fading 
channel with 9.26 Hz Doppler. The system design parameters 
defined in Section III are set to be (a, 7) = (O.o, 0.1, 0.1), 
and the synchronization time per stage is Nt — 15 slots (i.e., 
10 ms). It can be seen that during initial search when the MS 
is at the cell edge (i.e., lower G values), using coherent detec- 
tion in stage 2 significantly reduces the average acquisition time. 
The improvement is seen as 15-40% reduction in average ac- 
quisition time compared to using noncoherent detection when 
-6 < C? < 0 dB. For target search, the benefit of using co- 
herent detection in stage 2 seems small. 

Next we compare the performance of the pipelined process to 
a serial process similar to the one used in [6]. The serial search 
process is illustrated in Fig. 1 1, which has 30 ms synchroniza- 
tion time in stage 1 , 20 ms synchronization time in stage 2, and 
10 ms synchronization time in stage 3. It can be seen that the 
pipelined search process considered achieves faster acquisition 
time (Fig. 12). 

As mentioned in Section IV-B, coherent detection in stage 2 
needs to obtain a phase reference from the P-SCH correlation. 



In rare scenarios where two SCH signals are slot-aligned, the 
phase reference may be deteriorated, and thus may have nega- 
tive impact on the cell search perfonnance. To study this further, 
we use a two-cell simulation model. In addition to the signals 
shown in Fig. 4, another set of signals for a second cell is gener- 
ated, including a P-SCH, an S-SCH, a CPICH, and an intra-cell 
interference signal. These signals are of the same power as their 
counterparts in the first cell. Signals of the same cell are faded 
together just as illustrated in Fig. 4, whereas signals of different 
cells are faded independently. Fig. 13 shows the average acqui- 
sition time for initial search in the slot-aligned scenario. For 
comparison, the performance when the two SCH signals are not 
slot-aligned is also shown. The system and receiver parameter 
setting is the same as those used in Fig. 10, and the variable G' is 
defined as the ratio between the total power of the first cell and 
Px. Since the two cells have the same total transmitted power, 
the geometry factor of each of the cells can be related to G* by 
G = 1/(1 + (G')-^). From Fig. 13, it is found that using co- 
herent detection in stage 2 maintains its performance advantage 
over noncoherent detection, even when the two SCH signals are 
slot-aligned. 

To study the average acquisition time in realistic channel 
conditions, three W-CDMA test channels— CASE I (Indoor, 3 
km/h), CASE II (Indoor to Outdoor and Pedestrian, 3 km^), 
and CASE III (Vehicular, 120 km/h) [14]— are used. Table 
II lists the multipath delay profile of these test channels. The 
average acquisition time for initial search is shown in Fig. 14. 
For comparison purposes, the performance in a flat fading 
channel with 9.26 Hz Doppler is also shown. It is found that 
for initial search, the CASE III channel requires the longest ac- 
quisition time. This is because the lai^ge frequency error during 
initial search results in significant signal loss; and in a highly 
dispersive channel such as the CASE III channel, the eneigy of 
each resolvable multipath becomes extremely weak. Although 
CASE III has better path diversity (more resolvable multipaths) 
and time diversity (higher vehicular speed), the perfomiance 
is essentially limited by tfie weak signal strength of each 
multipath. However, from Fig. 14, it can be seen that when the 
mobile is closer to the center of a cell (larger geometry &ctor G 
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and stronger desired signal), the performance in the CASE III 
channel is very close to the other test channels. This is because 
as the signal strength of each multipath becomes stronger, 
path and time diversity becomes more helpful. Moreover, for 
G > — 3 dB, the average acquisition time during initial search 
is less than 1 00 ms in all W-CDMA test channels. 

The (performance of target search is shown in Fig. 15. It can 
be seen that a target can be identified within 42 ms in all test 
channels of interest, given its geometry factor is greater than —6 
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Fig. 14. Average acquisition time for initial search in various W-CDMA test 
channels (.V, = 15,rt = 0.5, a? = 0.1,7 = 0.1, 20 kHz frequency error). 

dB. Similar to the case of initial search, path and time diversity 
helps when the desired signal is strong; and when the desired 
signal is weak, higher dispersiveness results in weaker signal 
energy per multipath, which limits the performance. 

An interesting optimization problem for the Synchronization 
Channel is the power ratio, a, between the P-SCH and the Syn- 
chronization Channel. Table HI shows the average acquisition 
time in the CASE III test channel for initial and target search, 
respectively- The geometry factor is fixed as G = —3 dB, 
and coherent detection is used in stage 2 in both cases. From 
these results, we conclude that a good choice of or is approx- 
imately 0.6-0.7 for the pipelined process considered. This is 
because stage 2 performance is much better than stage I per- 
formance. Thus, it is advantageous to increase the power allo- 
cation for P-SCH. However, it should be noted that for other 
cell search processes such as the serial search process shown in 
Fig. 1 1, which has different averaging time among the different 
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Stages, the best choice of a might be different. Nevertheless, the 
pipelined process considered is shown to be superior to the se- 
rial search process; therefore, to achieve the fastest acquisition 
time, the system should optimize the parameter setting for the 
pipelined process. 

The CPICH loading factor (7) and the SCH loading factor (/?) 
are also critical to cell search perfonnance. Higher values of 7 
and 0 result in faster acquisition, at the expense of higher inter- 
ference to other physical channels in the same cell. On the other 
hand, the system can enhance the in-cell capacity by tolerating 
longer acquisition time. Table IV shows the average acquisition 
time as a function of the CPICH loading factor. It is seen that 
there is not much performance improvement by increasing the 
CPICH loading factor beyond 10%. However, when the CPICH 
loading factor is less than 5%, there is significant penalty in the 
cell search performance. Since in W-CDMA CPICH could also 
be used for channel estimation, it should be noted that when de- 
ciding the CPICH loading factor, the performance of channel 
estimation also needs to be taken into account. Table V shows 
the acquisition time as a function of the SCH loading factor. It 
is seen that when the SCH loading factor is below 10%, there is 
severe degradation in cell search performance. Since SCH has 
a duty factor of 10%, effectively it only uses total trans- 
mitted power of a cell on the average. Thus, lowering /? below 
1 0% does not seem to be an attractive system design choice. 
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TABLE V 

Average Acquisition Time Versus the SCH Loading Factor 3 (Case III 
Test Channels. .Vi = 15, n- = 0.5, /? = O.l, G = -3 ds 
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VI. Frequency Acquisition (Stage 4) 

In this section, the problem of frequency acquisition is 
studied. Without loss of generality, we assume that the modu- 
lation values for the pilot symbols in CPICH arc all Ts. Using 
the signal model shown in Fig. 3. the CPICH signal can be 
expressed as 



(19) 



where Oi is the tth chip of the scrambling code, and p(t) is the 
chip waveform. Similar to (4), the received signal sample can 
be expressed as (assuming a one-path channel) 

n- = v/^.9jl.afce^«2-^-*'-^*+*) + nit (20) 

where = go, k and tq is set to 0. Although the frequency error 
detector developed below is based on a one-path channel model, 
it can be easily extended to cases with dispersive channels. 

The first step of frequency acquisition is to despread the 
CPICH signal as 



(21) 



1=0 



where TNT is the despreading factor, and a* stands for the com- 
plex conjugate of a. Assume that at^al — 1, and the channel 
coefficient, ^J,, is constant over N chips. It follows that 



iV-l 



=5^/vC^^*^-*^^'c^* E (22) 

1=0 

==5. (c^-^a.))* ei^'iAif,) + n,) (23) 
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Where Qi = |pj,vl. = 2« /^Trf, fn = J2'iLo^ ^rf+t. and 

Aife)=Yl^'''^'"''' (24) 

The constant Td = NTc is the duration of a despread symbol, 
and rti is the white Gaussian component with variance a^. 
Note that the phase of the complex channel coefficient g'^j^ 
is folded into the random phase term 9*. Given Ld observ- 
ables, y = (j/o, yc^-i), the log-likelihood function for 
detecting the frequency error, conditioned on coefficients 
5 = (po, ffi, • . . , 9L^-i)f is given by [13] 

zi = lnp(i/|/« = /<, 5) 

= l"^o(^^ • l^'^l) - IPil^ (25) 

where 

/=0 

and = i/ifif*. Note that in (26), Yi is simply the Discrete 
Fourier Transform (DFF) of = (j/o, . . . , at irequency 

/,-. Given Te^ N, and fu the values of ^(/i) can be precal* 
culated using (24). The ML detector for frequency error is 
thus finding a hypothesized frequency error fi which maximizes 
Zi, within the frequency uncertainty region [— /max, /max]- The 
magnitude of the largest possible frequency error, fmtxt de> 
pends on the choice of crystal oscillator. 

In practice, one cannot have an infinite number of hypotheses; 
instead a finite number Kh of hypotheses is used. It is preferred 
that Kh is a power of 2, because for such Kh, the Fast Fourier 
Transform (FFT) can be used. However, for a finite Khy the 
accuracy of finequency detection is limited by the resolution of 
FFT. For a despread symbol duration Td, the frequency resolu- 
tion is A/ = l/Td/Kht and the hypothesized frequency error 
is 

^ r(t-l)A/ l<i<KH/2 + l 

^* \-{Kh-i-\-l)Af Kh/2-hl<i<KH. 

From (25), the decision statistic a: = (zi, Z2, . . . , Zh\ ) for all 
Kh hypotheses in AWGN can be expressed as 

where a = (|>l(/o)|, |>l(/i)|, . . . , |^(/fc j|)^. and the 
column vector Y is the jRTa -point FFT of y. In (27), if elements 
in vector a is approximated by >!(/•) » >l, Vt, a simpler 
suboptimal detector can be derived: 

= \Yl (28) 

This suboptimal detector in AWGN is simply the FFT of the de- 
spread values. It was shown in [13] that the suboptimal detector 



achieves performance almost identical to the ML detector. Fur- 
thermore, this suboptimal detector is in fact the same as the ML 
frequency error estimator for a nonspread signal presented in 
[IS] and [16]. 

As mentioned earlier, with finite Kd, the accuracy of the 
/Cf-Hypothesis ML detector is limited by the frequency resolu- 
tion A/ of the FFT algorithm. To improve the accuracy beyond 
this limitation, the idea of taking the derivatives of the log-like- 
lihood function with respect to /« was proposed in [16] and 
[17]. Here we propose another alternative based on performing 
quadratic interpolation in the frequency domain. This algorithm 
first finds the largest detection statistic zj and its corresponding 
frequency hypothesis /j, and then determines a quadratic curve 
z = wi/^ + V2f -f t;3, such that the points (/j_i, 4fj-i), 
ifjt ^j). and (/j+i , zj+i) are on the quadratic curve. It can be 
shown that the peak of the quadratic curve occurs at frequency 
(and thus our refined estimate) 

A. Proposed Scheme 

Based on the interpolated ifh-Hypothesis ML detector, the 
following scheme is proposed for achieving initial frequency 
acquisition in W-CDMA. 

The proposed scheme consists of: 1) despreading the CPICH 
signal using a despreading factor TVT = 64; 2) collecting L = 40 
despread values per slot;' 3) removing the modulation of these 
despread values using the knowledge of the pilot symbols; 4) 
calculating the detection statistics z = (zi, zj, Z64) using 
a 64-point FFT {Kh = 64), after zero-padding; 5) accumu- 
lating the detection statistics z over Nj slots; and 6) performing 
quadratic interpolation to find the estimated frequency error /«. 
With a chip rate of 3.84 Mchips/s, the frequency resolution of 
the 64-point FFT used in this scheme is therefore 937.5 Hz. 

For comparison, we also study the performance of a differen- 
tial detection (DD) scheme. In this approach, the CPICH signal 
is first despread by also using a despreading &ctor of 64 chips. 
After removing the modulation of the pilot symbols, each de- 
spread value is multiplied by the complex conjugate of its prece- 
dent despread value to get a metric relating to the phase rota- 
tion between two successive despread values. The metrics are 
then accumulated over Nf slots, and at the end, the accumulated 
metric is converted to the corresponding phase error and fre- 
quency error. Such a differential detector based scheme suffers 
from noise enhancement, especially when the signal-to-noise 
ratio of the despread value is low, e.g., less than 0 dB. 

The simulation results for a flat fading channel are shown in 
Fig. 16. The synchronization time used is 20 ms {Nj = 30) 
and the maximiun frequency error is 20 kHz. Results for both 
slow fading (9.26 Hz) and fast fading (463 Hz) are shown. The 
synchronization error rate is defined as the probability that the 
difference between the estimated frequency error and true fre- 
quency error is more than 200 Hz. One can see that the proposed 

'Since there are 2560 chips in a slot, with a despreading factor of 64, 40 
despread values can be obtained. 
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Fig. 16. Pcrfonnance comparison between the differential detection scheme 
and the proposed scheme in a flat fading channel. (A'/ = 30, = 20 kHz). 

scheme outperforms the DD scheme by more than 1 0 dB. More- 
over, the proposed scheme reduces the frequency error from 20 
kHz to 200 Hz with a very high reliability. Observe that for the 
case with 463 Hz Doppler spread, the proposed scheme even 
achieves frequency accuracy beyond the Doppler spread. 

VII. Conclusion 

Two scenarios of cell search in Wideband CDMA 
(W-CDMA) have been studied: 1) the initial cell search 
when a mobile station is switched on; and 2) the target cell 
search idle and active modes. The frequency errors considered 
are 20 kHz and 0 Hz for the initial and target cell search, 
respectively. The large frequency error results in large in- 
coherence loss during initial search. This problem is solved 
by partial symbol desprrading and noncoherent combining 
between partial symbols. With a maximum frequency error 
of 20 kHz, despreading using 64 chips long symbols instead 
of 256 chips is used. This partial symbol despreading is also 
used in stage 2 with either noncoherent or coherent combining 
using the primary synchronization channel to obtain a channel 
estimate. It is shown that coherent stage 2 detection improves 
the average acquisition time performance significantly, even in 
the slot-aligned scenario, where the synchronization channels 
from two cells are perfectly aligned. Optimization of key 
system parameters such as Primary Synchronization (P-SCH) 
Channel, Synchronization Channel (SCH), and Common Pilot 
Channel (CPICH) loading factors is studied. It is found that 
the best choice for the P-SCH loading factor is approximately 
0.6-0.7 for the pipelined process considered, and the SCH 
loading factor should be no less than 10%. For capacity and 
acquisition time tradeoff, a good choice for the CPICH loading 
factor is in the range of 5% to 10%. Finally, a maximum likeli- 
hood based frequency estimation method based on despreading 
the CPICH signal, the Fast Fourier Transform, and a quadratic 
interpolation in the frequency domain is proposed. It is shown 
that this proposed method achieves more than 10 dB gain, 
compared to a differential detection scheme. 
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